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VERIFIED STATEMENT (DECLARATION) CLAIMING SMALL ENTITY STATUS 
(37 CFR 1.9(f) and 1.27 (d) --SMALL BUSINESS CONCERN 



I hereby declare that I am: 

the owner of the small business concern identified below: 

X an official of the small business concern empowered to 
act on behalf of the concern identified below: 

Name of Concern: RAMOT- UNIVERSITY AUTHORITY FOR APPLIED RESEARCH 

AND INDUSTRIAL DEVELOPMENT , LTD. 

Address of Concern: 32 Haim Levanon Street - P.O. Box 39296 

Tel -Aviv 61392 Israel 



I hereby declare that the above identified small business concern qualifies as a 
small business concern as defined in 13 CFR 121.3-18, and reproduced in 37 CFR 
1.9(d), for purposes of paying reduced fees under Section 41(a) and (b) of Title 
35/' United States Code, in that the number of employees of the concern, including 
those of its affiliates, does not exceed 500 persons. For purposes of this 
statement: (1) the number of employees of the business concern is the average 
over the previous fiscal year of the concern of the persons employed on a full- 
time, part-time or temporary basis during each of the pay periods of the fiscal 
year, and (2) concerns are affiliates of each other when, either directly or 
indirectly, one concern controls or has the power to control the other, or a 
third-party or parties controls or has the power to control both. 

I hereby declare that rights under contract or law have been conveyed to and 
remain with the small business concern identified above with regard to the 
invention referenced above. 
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the specification filed herewith. 

application referenced above* 

patent referenced above. 
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If the rights heir 1 "^y the above- identified small br less concern are not 
exclusive, each ii idual, concern or organization iving rights to the 
'"invention is listed below* and no rights to the invention are held by any person, 
other than the inventor, who would not qualify as an independent inventor under 
37 CFR 1.9(c), if that person made the invention, or by any concern which would 
not qualify as a small business concern under 37 CFR 1.9(d) or a nonprofit 
organization under 37 CFR 1.9(e) . 

* NOTE: Separate verified statements are required from each 
named person, concern or organization having rights to 
the invention averring to their status as small entities . 
(37 CFR 1.27} 

NAME: 

ADDRESS: 



Individual Small Business Nonprofit Organization 



NAME: 
ADDRESS : 



Individual Small Business Nonprofit Organization 

I acknowledge the duty to file, in this application or patent, notification of 
any change in status resulting in loss of entitlement to small entity status 
prior to paying, or at the time of paying, the earliest of the issue fee or any 
maintenance fee due after the date on which status as a small entity is no longer 
appropriate. [37 CFR 1.28(b)] 

I hereby declare that all statements made herein of my own knowledge are true and 
that all statements made on information and belief are believed to be true; and 
further that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment r or both, 
under Section 1001 of Title 18 of the United States Code, and that such willful 
false statements may jeopardize the validity of the application, any patent 
issuing thereon, or any patent to which this verified statement is directed. 
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IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



In re application of: Eugene Rosenberg, et al 

Continuation of United States Patent 
Application No. 09/240,537, filed: 01/29/99 

Filed: Herewith 

For: GENE CLUSTER 

Attorney Docket No. 2290.00101 

PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents 
Washington, D.C. 20231 
Box Patent Application 

Dear Sir: 

Please preliminarily amend the above-captioned Continuation patent 
application prior to examination as follows: 

IN THE SPECIFICATION: 

Page 1, in the "Cross-Reference to Related Application Section", after 
"This is a", please insert: 

-Continuation application of United States Patent Application Serial 
No.: 09/240,537, filed January 29, 1999, all of which is incorporated herein by 
reference—. 

Page 3, line 17, please delete "1 and". 

Page 4, line 12, after "DNA", please insert -and amino acid-. 
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IN THE CLAIMS: 

1. (Twice Amended). A purified, isolated and cloned DNA or amino 
acid sequence encoding a polypeptide required for the synthesis of antibiotic TA [or a 
shorter polypeptide portion of said polypeptide] , said polypeptide being selected 
from the group consisting essentially of SEQ ID NOS 1-19 and analogs thereof . 

3. (Twice Amended). A purified, isolated and cloned DNA or amino 
acid sequence consisting of a [DNA] sequence encoding a polypeptide required for 
post modification of antibiotic TA [or a shorter polypeptide portion of said polypeptide] 
said polypeptide being selected from the group consisting essentially of SEQ ID NOS 
1 -1 9 and analogs thereof . 

5. (Amended) A purified, isolated and cloned DNA or amino acid 
sequence consisting of a [DNA] sequence encoding a gene product involved in a 
regulation of the biosynthesis of antibiotic TA said polypeptide being selected from 
the group consisting essentially of SEQ ID NOS 1-19 and analogs thereof . 

7. (Twice Amended) A purified, isolated and cloned DNA sequence 
consisting of a DNA sequence as set forth in SEQ ID NO: [1 and] 2. 
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8. (Twice Amended) The DNA sequence of SEQ ID NO: [1 and] 2 
altered by point mutations, deletions or insertions such that the resulting amino acid 
sequence is shortened. 

Claim 9, line 1, please delete "1 or". 

1 5. (Twice Amended) A method of combinatorial genetics using the 
TA gene s as set forth in SEQ ID NOS 1-19 f or use in combinatorial genetics. 

16. (Twice Amended) A method of encoding for the synthesis, 
modification or regulation of antibiotic TA by using a TA gene as set forth in SEQ ID 
NOS 1-19 for encoding for the synthesis, modification or regulation of antibiotic TA. 

REMARKS 

Claims 1-16 are currently pending in the application. Claims 1, 3, 5, 7, 
8, 9, 1 1 and 16 are in independent form. 

The Office Action states that the Information Disclosure Statement filed 
on February 15, 2000 fails to comply with 37 CFR 1.198(a)(2), which requires a 
legible copy of each U.S. and foreign patent and each publication which is listed. 
Copies of the missing references are attached hereto. 
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Claims 1-6 stand rejected under 35 U.S.C. § 112, first paragraph, as 
containing subject matter which was not described in the specification in such a way 
as to reasonably convey to one skilled in the relevant art that the inventors, at the 
time the invention was filed, had possession of the claimed invention. 

The Office Action states that the instant claims are directed to DNA 
sequences encoding or partially encoding polypeptides for the synthesis, post- 
modification, and/or regulation of the antibiotic TA where the claimed products 
are defined by their functional characteristics. However, the Office Action holds 
that in claims to genetic material a generic statement such as "vertebrate insulin 
cDNA" without more is not an adequate written description of the genus since it 
does not distinguish the genus from others, except by function. The Office Action 
concludes that one skilled in the art cannot visualize or recognize the identity of 
the members of the genus. However, the claims as pending do state that there 
must be present a specific polypeptide which is utilized in the synthesis of 
antibiotic TA. This statement does sufficiently describe a structural feature 
commonly possessed by members of the genus such that the members of the 
genus must include therein at least one polypeptide which is utilized in the 
synthesis, post-modification or regulation of the antibiotic TA. Accordingly, 
reconsideration of the rejection is respectfully requested. 
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Claims 7-9 and 10-14, stand rejected under 35 U.S.C. Section 112, 
second paragraph, as being indefinite for failing to particularly point out and 
distinctly claim the subject matter which Applicants regard as the invention. 
Specifically, the Office Action states that in the claims the sequence numbers are 
referred to as DNA sequences, while in the Sequence Listing they are referred to 
as amino acid sequences. This was an error found in the Sequence Listing, which 
error has been remedied with the attached Sequence Listing. This correction thus 
obviates the present rejection. 

The Office Action states that claims 15 and 16 are rejected under 35 
U.S.C. Section 101 because the claimed recitation of a use, without setting forth 
any further steps involved in the process, results in an improper definition of a 
process. Accordingly, both claims 15 and 16 have been amended to either recite 
a proper method claim or the language has been amended to no longer recite a 
method claim. Reconsideration of the rejection is respectfully requested. 

Claims 1-2 stand rejected under 35 U.S.C. Section 102(b) as being 
anticipated by general scientific knowledge. The Office Action states that in claim 
1, line 1, the claim to "DNA sequences partially encoding... polypeptides" without 
defining the term "partially" claims fragments as small as three nucleotides, or a 
single code, encoding one amino acid would be included in the claim. 
Furthermore, the Office Action states that the nucleotide database of GenBank 
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contains greater than 100 nucleotide sequences from the cited species at the 
time the invention was made, all of which anticipate claims 1-2. However, when 
read more specifically, none of the published sequences were required for the 
biosynthesis or post-modification of antibiotic TA. Additionally, there were no 
cited references teaching the claimed sequences. As a matter of law, there must 
be a reference cited which teaches subject matter even if the subject matter is 
held to be in the general knowledge. That is, any holding of a limitation being in 
the general knowledge must be supported by a citation. Since the prior art does 
not disclose any sequences for the biosynthesis or post-modification of antibiotic 
TA as recited in pending claims 1 and 2, the claims are not anticipated by the 
cited general scientific knowledge and reconsideration of the rejection is 
respectfully requested. 

Claim 8 stands rejected under 35 U.S.C. Section 102(b), as being 
anticipated by general scientific knowledge. The Office Action cites that in claim 
8, line 2, the claim recites DNA sequences resulting in truncated amino acid 
sequences. The Office Action states that without any further limitation of the term 
"truncated", this claim language broadly encompasses DNA fragments as small 
as three nucleotides, or a single code on which sequences are found throughout 
scientific literature. Claim 8 has been amended in order to further prosecution, to 
remove the term "truncated". Additionally, when read more specifically, none of 
the published sequences were required for the biosynthesis or post-modification 
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of antibiotic TA. As this requirement is recited in the claim language of pending 
claims 1 and 2, these claims are not anticipated by the cited general scientific 
knowledge. Reconsideration of the rejection is respectfully requested. 

It is respectfully requested that the present amendment be entered 
in order to place the application in condition for allowance or at least in better 
condition for appeal. The application is placed in condition for allowance as it 
addresses and resolves each and every issue that remains pending. The 
amendments overcoming the rejections under 35 USC 112 are made exactly as 
suggested by the Office Action. Claims have also been amended to clearly 
distinguish over the prior art. The application is made at least in better condition 
for appeal as the amendment removes many issues thereby simplifying the 
issues on appeal. That is, each and every rejection under 35 USC 112 has been 
overcome exactly as suggest in the Office Action. Further, the claims have been 
amended to more specifically define the invention while raising no new issues 
which would require any further searching. Rather, the amendments have been 
made in view of comments made in the Office Action which clearly distinguish the 
presently pending claims over the cited prior art. Hence, it is respectfully 
requested that the amendment be entered. 
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ln conclusion, it is respectfully requested that the present 
amendment be entered in order to place the application in condition for 
allowance, which allowance is respectfully requested. 

The Commissioner is authorized to charge any fee or credit any 
overpayment in connection with this communication to our Deposit Account No. 



11-1449. 



Respectfully submitted, 



KOHN& ASSOCIATES 



Amy E. Rinateter 
Registration No. 45,791 
30500 Northwestern Hwy. Ste. 410 
Farmington Hills, Michigan 48334 
(248) 539-5050 




Dated: November 1 0, 2000 
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Mail Po§t Office to Addressee" addressed to the Assistant Commissioner for Patents, Washington, D.C. 
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GENE CLUSTER 

BACKGROUND OF THE INVENTION 

5 Polyketides constitute a large and highly diverse group of secondary 

metabolites synthesized by bacteria, fungi and plants, with a broad range of biological 
activities and medical applications. They include anti-cancer agents (Daunorubicin), 
antibiotics (tetracyclines, erythromycin etc.), immunosuppressants (macrolide FK506) 
and compounds with mycotoxic activity (aflatoxins, ochratoxins, ergochromes, 

10 patulin etc.). Polyketides are synthesized by repetitive condensations of acetate or 
propionate monomers in a similar way to that of fatty acid biosynthesis. Structural 
diversity of polyketides is achieved through different thioester primers, varying chain 
extension units used by the polyketide synthases (PKSs), and variations in the 
stereochemistry and the degree of reduction of intermediates. Diversity is also 

15 achieved by subsequent processing, such as alkylations, oxidations, O-methylations, 
glycosylations and cyclizations. Genetic studies indicated that gene organization of 
functional units and motif patterns of various PKSs are similar. This similarity was 
used to identify and obtain new PKS systems in both gram negative and gram positive 
bacteria. 

20 

PKS systems are classified into two types: type I PKSs are large, 
multifunctional enzymes, containing a separate site for each condensation or 
modification step. These represent "modular PKSs" in which the functional domains 



-1- 



2250 . 00075 

encoded by the DNA sequence are usually ordered parallel to the sequence of 
reactions carried out on the growing polyketide chain. Type II PKSs are systems 
made up of individual enzymes, in which each catalytic site is used repeatedly during 
the biosynthetic process. 

Genetic studies on prokaryotic PKSs have focused on gram positive 
microorganisms, particularly on actinomycetes. Mycobacteria are gram negative 
bacteria that produce a large number of secondary metabolites, including polyketides. 
Myxococcus xanthus produces TA (Rosenberg, et al., 1973; Rosenberg, et al., 1984), 
which is an antibacterial antibiotic. 

The polyketide antibiotic Tel- Aviv (hereinafter TA) (Rosenberg, et al., 1973) 
is synthesized by the gram negative bacterium Myxococcus xanthus in a unique multi- 
step process incorporating a glycine molecule into the polyketide carbon chain, which 
is elongated through the condensation of 11 acetate molecules by a type I polyketide 
synthase (PKSs), 

The antibiotic TA was crystallized and its chemical properties were 
determined. It is a macrocyclic polyketide synthesized through the incorporation of 
acetate, methionine, and glycine. It inhibits cell wall synthesis by interfering with the 
polymerization of the lipid-disaccharide-pentapeptide and its ability to adhere avidly 
to tissues and inorganic surfaces makes it potentially useful in a wide range of clinical 
applications, such as treating gingivitis. 
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A growing interest in the study of PKS systems and peptide synthetase 
systems 3tems from the need to develop new potent biologically active compounds. 
The use of combinatorial genetics in both systems (PKS and peptide synthetase) 
separately has led to the production of new polyketides and new peptides. 

5 

It would therefore by useful to be able to generate new biological agents from 
secondary metabolites of the antibiotic TA. 

SUMMARY OF THE INVENTION 

10 

According to the present invention, there is provided a purified, isolated and 
cloned DNA sequence partially encoding a functional portion of a polypeptide 
component required for the synthesis of antibiotic TA. Also provided are purified, 
isolated and cloned DNA sequences encoding a polypeptide component required for 

15 postmodification of antibiotic TA and encoding a gene product involved in the 
regulation of the biosynthesis of antibiotic TA. A purified, isolated and cloned DNA 
sequence having a DNA sequence (Seq. ID No:l and 2) encoding a polypeptide 
component required for encoding the TA gene cluster and any mutations thereof is 
provided. Also provided are methods of using the TA genes for combinatorial 

20 genetics and of using the TA genes encoding for synthesis and modification or 
regulation of antibiotic TA. 
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DESCRIPTION OF THE DRAWING 

Other advantages of the present invention will be readily appreciated as the 
same becomes better understood by reference to the following detailed description 
5 when considered in connection with the accompanying drawing wherein: 

Figure 1 shows the physical maps of the DNA regions involved in TA 
synthesis. 

1 0 DETAILED DESCRIPTION OF THE INVENTION 

The present invention consists of a DNA sequence of at least 42 kb encoding 
genes involved in TA production and Myxococcus xanthus as best shown in Seq. ID 
No:l through 17 and cosmid clones containing the entire TA gene DNA sequences. 
15 The TA gene cluster has been purified, isolated, and cloned. The purification, 
isolation and cloning was done according to the methods described in Marshak et al, 
"Strategies for Protein Purification and Characterization. A laboratory course 
manual." CSHL Press, 1996. 

20 A DNA fragment of at least 42 kb (Figure 1), encoding genes involved in TA 

production in Myxococcus xanthus has been identified, cloned and analyzed. These 
steps were done in accordance with Marshak et al s "Strategies for Protein Purification 
and Characterization. A laboratory course manual." CSHL Press, 1996. This 
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fragment contains a large region of about 20 kb, encoding the genes responsible for 
the regulation and the post-modification of TA. An additional fragment of 
approximately 8-10 kb located 10-20 kb downstream of the post-modification region, 
encodes the enzyme responsible for the incorporation of the glycine into the 
5 polyketide chain. This novel polypeptide is made up of a peptide synthetase unit 
lying between two PKS modules. 

The potential of this unique polypeptide in combining the two systems can 
lead to a new family of compounds, emerging from various combinations which can 
10 be utilized for combinatorial genetics. Such utilization can produce, for example, new 
bioactive agents, new polyketides and new peptides. Additionally, the TA gene 
cluster can be utilized in a method for the synthesis, modification or regulation of the 
TA antibiotic. 

15 Mutations imparting defects into the TA gene cluster can be point mutations, 

deletions or insertions. The mutations can occur within the nucleotide sequence of the 
allele of the TA gene cluster such that the resulting amino acid sequence of the TA 
gene cluster product is altered. 

20 In one embodiment of the present invention, the TA gene cluster can be 

included in a vector or recombinant expression vector. This vector containing the TA 
gene cluster is able to transform a suitable eucaryotic or procaryotic host cell. A 
suitable host cell can be determined by one skilled in the art. An example of a 
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suitable cell which can be transformed by the TA gene cluster is an E. coli cell. 

In another embodiment of the present invention, the a DNA fragment encoding 
the TA gene cluster can be cloned into a cosmid, as shown in Figure L This DNA 
5 fragment contains a large region of about 20kb, encoding the genes responsible for the 
regulation and the post-modification of TA. An additional fragment of approximately 
eight to ten kb is located 1 0-20 kb downstream of the post-modification region and 
encodes the enzyme responsible for the incorporation of the glycine into the 
polyketide chain. The novel polyketide chain is made up of a peptide synthetase unit 
10 lying between two PKS modules (See Figure 1). 

The above discussion provides a factual basis for the use of the TA gene 
cluster. The methods used with and the utility of the present invention can be shown 
by the following non-limiting examples and accompanying figure. 

15 

EXAMPLES 

GENERAL METHODS : 
METHODS : 

General methods in molecular biology: Standard molecular biology 
2 0 techniques known in the art and not specifically described are generally followed as in 
Sambrook et aL, Molecular Cloning: A Laboratory Manual, Cold Springs Harbor 
Laboratory, New York (1989, 1992), and in Ausubel et aL, Current Protocols in 
Molecular Biology, John Wiley and Sons, Baltimore, Maryland (1989). Polymerase 
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chain reaction (PGR) is carried out generally as in PCR Protocols: A Guide To 
Methods And Applications, Academic Press, San Diego, CA (1990). Reactions and 
manipulations involving other nucleic acid techniques, unless stated otherwise, are 
performed as generally described in Sambrook et al., 1989, Molecular Cloning: A 
5 Laboratory Manual^ Cold Spring Harbor Laboratory Press, and methodology as set 
forth in United States patents 4,666,828; 4,683,202; 4,801,531; 5,192,659 and 
5,272,057 and incorporated herein by reference. In-situ (In-cell) PCR in combination 
with Flow Cytometry can be used for detection of cells containing specific DNA and 
mRNA sequences (Testoni et al, 1996, Blood 87:3822.) 

10 

Recombinant Protein Purification 

Marshak et al, "Strategies for Protein Purification and Characterization. A 
laboratory course manual/ 1 CSHL Press, 1996. 

15 Example Is 

Analysis of the TA gene cluster by chromosomal restriction map. 

Chromosomal DNA of several transposition mutants (ER-2514, ER-1037, ER- 
1030, ER-1311, ER-7513, ER-3708, ER-4639 and ER-6199; Varon et al, 1992) was 
extracted, digested with restriction enzymes that cut within the transposon, and 

20 analyzed by Southern hybridization with six different probes (originating from TnV 
and Tn5/ac). We used probes designed to hybridize either to the entire transposon, or 
to its 5' or 3 ? ends. A chromosomal restriction map of the whole gene cluster was 
constructed on the basis of these results (Figure 1). The data refined the transduction 
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map (Varon et al 3 1992) and farther indicated that all the genes in the cluster are 
transcribed in the same direction (see Figure 1). 

Preparation of TA-specific probes 
5 DNA fiom the TnF mutant BR-4639, ER13 1 1 and ER-6 1 99 was digested with 

Kpnl (does not restrict TnF), self-ligated and transformed into E. coli XLl-Blue MR 
using the transposon-derived kanamycin resistance for selection. Tranformant clones 
pPYT4639, pPYT1311/p5 and pPYT6199 carried a 1.5 kb, 2.3 kb and a 11.2 kb 
fragment, respectively (see Figure 1). 

10 

Cloning of a M xanthus DNA region encoding genes involved in TA 
biosynthesis. 

A library of M. xanthus ER-15 was constructed in the cosmid vector 
SUPERCGS-1 and screened using specific TA probes obtained from transposition 
15 mutants (ER-4639, ER-1311 and ER-6 199, see map) that contain a TnV transposon. 
Seventy four recombinant cosmids that carried genes required for TA production were 
identified through colony hybridization. The cosmids, pPYCC64 and pPYCC44, 
which hybridized to these probes were further characterized through restriction 
analysis (see Figure 1) and sub cloned for sequencing. 

20 

Throughout this application, various publications, including United States 
patents, are referenced by author and year and patents by number. Full citations for 
the publications are listed below. The disclosures of these publications and patents in 
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their entireties are hereby incorporated by reference into this application in order to 
more fully describe the state of the art to which this invention pertains. 

The invention has been described in an illustrative manner, and it is to be 
5 understood that the terminology which has been used is intended to be in the nature of 
words of description rather than of limitation. 

Obviously, many modifications and variations of the present invention are 
possible in light of the above teachings. It is, therefore, to be understood that within 
10 the scope of the appended claims, the invention may be practiced otherwise than as 
specifically described. 
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SEQ LISTING PAGE(s) 
REGION 1: 

Tal - Fe^tidggYizLtjietage uiait-FKS saodnie. 

FRAGMENT gi2efea):2392 

VDPA3^TRAWEGLLERTO 

GPLA^BX-YSRSEHEETVXXLCFHHL^ 

LAlGGDEGRRHLDYWHVLATPVP^ 

LPTVLLGLYYAI^BMira^ 

VIDGLEHAHYPFPRVVKDIJ^ 

PLELEWEGAXGLTLHFKYD^^ 

WNATATPFLEDLGVHELFQRQaIU^^ 

DRSAELVAAMLGVLSAGAAYVPIX>PVKPED^ 

DV&AGATSAPAGTSPNGLaYVIYTSGSTGS^KGY^ 

IAALELLLPLCAGAQVIIASAETVRB^ 

ALPESLSO^HFVRTASDVWN^^ 

LWlAGAGVACGYl-NRPALTAERFVS>5pp 

GEIEaQLAGHFSVKNCAVVAKELNGTSQ 

LSGNGKVDRGQLMARPVVTRRKTSAVHAR^^ 

LVEEMNR&FDTSO.AVTO 

PGAADPWRPWK^REGRDSW 

FMDPQFRlJJ^MRAWKAVroAA 

QaGSSPTMVSYKLGLKGPSLFVTITO^ 

NFSSAGRVXAFDAAADGMXAGEGVAVLV^^ 

GQAEVIRM,FDRTGIDFASXGYVEAHGTGTL^^ 

AGLAGLIKTALSLRQGEVPPTlxHVTQ 

^E]TYPRD5RFRER8QRSNAVRaVAPF8AR^ 
ERIVfVVTASTRDBLVEGLRRGlATVGGAHVGTVVDTSP 

^^^^^ 

GAEGASALASll^DRRXEVVRTSSPSARI^A^^ 
^AS^ 

SSLAAFGNAGQSDYAAANGFMDG^^ 

YTRDSLAQRFGSAVGVFAGriXTGYELYGAELEGRDASV^YTSFASVA]^ 
SASLTAVHMACEALQRGACVMA1AGGVNLYVHPSSYV 
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BNA sequence aixcieotides 1-71 78. 



GTCGACCCGOCGAGGCTGACCCGGGCCTGGGAAGGACTGCTCGAACGGTATCCGCTGCTCGCTGGC 

GCGATTCGCGTCGAAGGCACGGAGCCGGTCATCGTCCXCAGTGGGCAGGTCTCCGCCGAGGTCCAC 

GAGGTTCCATCGGTCTCCGATTCAGCACTGGTGGCGACCCTGCGCGCCTCCGCGAAGGTGCCATTCG 

ATCTCGCCTGTGGACCGCTCGCTCGGCTGCACCTGTACTCGCGGTCGGAGCACGAGCATGTCCTGCT 

GCTGTGCTTCCACCACCTGGTGCTCGATGGGGCATCCGTGGCGCCCTTGCTCGACGCCCTCCGGGAG 

CGTTACGCCGGGACCGAGGCGAAGGCGGGGCTGCTCGAGGTTCCGATCGTCGCTCCTTACCGCGCC 

GCCGTGGAGTGGGAGCAGCTCGCCATTGGAGGCGATGAGGGACGGCGCCACCTCGACTACTGGCGG 

CACGTGTTGGCCACGCCCGTTCCTCCGCCGTTGAATCTTCCAACGGACCGGCCTCGCTCCGCCACGG 

GGCTGGACTCGGAGGGAGCAACGCACTCGCAGAGGGTGCCCACCGAGCAAGCATTGCGACTGCGCG 

AGTTCGCTCGGGCACAGCAAGTGAGCCTGCCGACCGTCCTGCTCGGGCTCTACTACGCCTTGCTTCA 

TCGGCACACGCGCCAGGACGACGTGGTGGTCGGCATCCCCACCATGGGGCGGCCCCGGGCGGAACT 

GGCGACGGCGATTGGGTACTTCGTCAACGTGATGGCCGTGCGCGCGCGGGGCCTGGGGCAGCACTC 

GTTCGGCTCGCTGCTGCGCCACCTCCACGACTCGGTCATCGATGGCCTGGAGCATGCCCACTATCCC 

TTCCCGCGAGTGGTGAAGGACCTCCGGCTGTCGAATGGGCCCGAGGAGGCGCCTGGCTTCCAGACG 

atgttgaccttccagagcctgcaactgacgagcgctccgccaaggccggagcccaggtcgggcggg 

TTGCCGGAGCTTGAGCCGCTCGACTGCGTCCATCAGGAAGGCGCCTACCCGCTGGAGCTTGAAGTGG 
TGGAGGGCGCGftAGGGCCTCACGCTGCATTTCAAGTACGACGCGCGGCTGTACGAGGCGGACACGG 
TCGAACGGATGGCGCGTCAGTTGTTGCGCGCCGCGGACCAGGTCGCGGATGGGGTGGAGTCTCCGC 

tgagcgcactgtcgtggctcgacgacgaagagcgccgcacgcttctccgcgactggaatgccacgg 

CCACGCCGTTCCTCGAGGACCTGGGCGTTCACGAGCTCTTCCAGCGGCAGGCCCGGGAGACCCCAG 

acgccatggctgtgagctacgaggggcactcgctcagctatcaggcgctggatacgcggagccgcg 

AGATTGCGGCGCACCTGAAGAGCTTCGGCGTCAAGCCTGGGGCGCTCGTGGGCATCTACCTGGaCC 

ggtccgcggagcxggtggcggcgatgctgggtgtgctgtccgctggcgcggcctacgtacccctgg 

acccggtgcaccccgaggaccggctgcggtacatgctggaggacagtggcgtggtggtcgtgctgg 

cccgtcaggcctcgcgggacaaggtcgccgccattgccggagcctcctgcaaggtgtgcgtgctgg 

aggacgtcaaggctggagccacgtccgcgccggcgggaacctcaccgaacggacttgcctacgtca 

tctacacgtccgggagcacgggccggcccaagggcgtgatgattccccatcgcggggtggtcaact 

tcctcctgtgcatgcgcaggacgctgggcctgaagcgcacggattcgctgttggcggtcacgacgta 

ctgcttcgacatcgcggcgctcgagctcctgcttccgctgtgtgggggggcgcaggtcatcatcgcg 

tgggcggagacggttcgggatgcgcaggcgttgaagcgggcgctgcgcacccatcggcccacgttg 

atgcaggcgacgcccgggacctggacactgttgttccagtctggctgggagaacgccgagcgggtt 

cgaatcctctgcggtggagaagcgctgccggagtcgctcaaggcccacttcgttcgcaccgcgagc 

gacgtgtggaaCatgttcgggcccaccgagacgaccatctggtcgacgatggcgaaggtctcggcc 

tcgcgtccggtcaccattggaaagccgatcgacaacacgcaggtctacgtgctggacgaccggatg 

cagccggtgcccatcggtgtgccgggggagctgtggattgcgggcgcgggcgtggcctgcggttac 

ctcaaccggccggcgctgaccgccgagcgcttcgtttcca.\tccgttcacgccgggcacgacgctct 

accggacgggggacctggcgcgctggcgcgctgacggtgaggttgagtacctggggcggctcgacc 

accaggtgaaggtgcgcggcttccgcatcgagatgggggagattgaagcgcagttggccgggcatc 

ccagcgtgaagaactgtgccgtggtggccaaggagctgaacggcacctcgcagctcgtcgcctact 

gtcagcccgcgggaacgagcttcgatgaggaagccatccgtgcacacctgcggaagttcctccccg 

actacatggtccccgcgcacgtcxtcgcggtggatgggattccggtgtcgggcaatggcaaggtgga 

ccggggccagctgatggccaggccggtggtcacccggcggaagacatccgcggtccatgcccgttc 

gcctgttgaggccaccctcgtcgagctgtgga.^gaacgtgctccaggtcaacgaggtgggtgtcga 
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GGATCGCTTCnCGAAGTGGC-GGGGGACTCCGTGCTGGCCGCCGTGCTGGTGGAGGAGATGAACCG 
GCGCTTCGACACGCGGCTCGCCGTCACCGACCTGTTCAAGTACGTCAATATTCGCGACATGGCGCGC 

cacatggagggcgcgacggcgcaagcccgtactggggccaccgagccggctcgcgaggacaccgc^ 

TCGGAGCGTGACTACGAGGGCAGCCTGGCCGTCATCGGCATCTCCTGTCAGTTGCCCGGAGCCGCGG 

acccctggcgcttctggaagaacctgcgagagggcagggacagcgtggtggcgtaccgccatgagg 

aactgcgcgagctgggcgtgcccgaggaggtcctccgcgattcccgttacgtcgcggtccggtcgtc 

catcgaagacaaggagtgcttcgacccgcatttcitcggtctgacggcgcgggacgcgtccttcatg 

gacccgcagttccgactgttgctgatgcacgcctggaaggcagtggaagacgcggcgacgacgcct 

gagcgcctgggaccgtgcggcgtcttcatgacggccagcaacagcttctatcaccagggctcgccgc 

aatttcctgcggacgggcagccggtcctccgcaccgccgaagaatacgtgctgtgggtgctggccca 

ggcaggctccatcccgacgatggtttcstacaagctcggcttgaaggggccgagcctgttcgtccac 

accaactgctcgtcatccctgtccgcgctgtacgtggctcagcaggccatcgcagcgggagactgcc 

agacggcgctggtgggggccgccacggtcttcccttcggcgaacttgggttatctgcaccagcggg 

ggctcaacttctccagcgcggggcgggtcaaggccttcgacgccgcggcggacggcatgattgccg 

gtgaaggtgtcgccgtgctggtggtgaaggacgccgcagcggcggtgcgcgatggcgacccaatct 

actgcctcgtgcggaaggtggggatcaacaacgacggccaggacaaggtgggtttatacgccccga 

gcgccaccgggcaggcggaggtcatccggcgtctgttcgaccggaccggcatcgaccctgcatcga 

ttggctacgtcgaggcccatggcaccggaaccttgctgggtgaccctgtcgaggtctccgcgctgag 

cgaagccttccggacctxcaccgaccggcgcgggtactgccggctgggctcggtgaagtcgaacct 

gggccaxctggacacagtggctggactggctgggctcatcaagacggcgctgagcctgcggcaggg 

CGAAGTTCCTCCGACGCTCCATGTGACCCAGGTGAATCCGAAGCTCGAGCTGACGGATTCGCCGTTC 

gtcatcgccgaccgtttggcgcggtggccgtccctgccgggaccgaggcgggcggccgtgagtgcg 

ttcggccttggcgggacgaatacccacgccattctcgaacactacccgcgcgactcccgcccacggg 

agaggagccagcggtcgaacgcagtccgtgcggtggctccattctcggcgcgcaccctggaggcgt 

tgaaggacaacctccgcgcgctgctcgacttcctggaggacccggcgtccgcggaggtggcgctcg 

cggacatcacctacacgttgcaggtcggccgggtcgcgatgcctgagcggatggtggtgactgcgt 

cgacgcgcgacgaattggtggagggactgcggcgaggcatcgcgacggtgggcggtgcccacgtgg 

gaacggtggtcgatacgtcacccagcgtggatgccgatgctcgggcagttgcggaggcgtgggcga 

cgggcgactcgattgactgggattcgctgcacggtgacgtgaagcccgcccgtgtcagcctgcccac 

gtatcagttcgcgaaggagcgctacgggttgtcgcccgcgcactccgtggcgaattcctccaagacg 

catcctgacgcgggtgtcccgctcttcgttccgacctggcagccgtggtctgagggcgggtcaaatg 

cctcgttggcgctccggcacctggtggtgttgtgcgagcctcttgatgcgctgggggctgaaggtgc 

ctccgcgctggcgagcacgctcgcggacaggcgcatcgaagtggtcaggacgtccagcccaagtgc 

gcggctggacgcgcggttcatggcgcatgcctcggcggtcttcgaacgcgtcaaggcgctgctgtcg 

gagcgtctgaccgctcctgtgacattgcaggtgctggtgccagaggagcgggatgcgctggcactg 

agtggcctggggagcctgctgcgttcggtgtcgcaggagaatccgttggtccgggggcagctcatc 

cgcgtccagggaagcgtctccgca-tcggcgctggtggacgttctggtgaagtccgcgcgcgccggt 

gacgtcaccgattggcggtaccacgcgggccagctrrctcgctgtgagtggcgcgaggcacgtgtcg 

ccaagggggacgcatcccgcttctggcgcgaagacggcgtctatgtgatttcaggaggaaccggcg 

ccctggcccggctgttcgtcgccgaaatcgggaagcgcgcgacgcgggccaccgtcattctggttgc 

ccgcgcatcctcggcggaggcggtggacggtgggaacgggctgcgcgtgcggcaccttcccgtgga 

tgtcacccaaccgaacgacgtgaacgcctttgtcgctacggtgctgcgcgaacacgggcgcatcgac 

ggtgtcatccatgcggcgggcatccgccgtgacaactacctgctcaacaagccggtggcggaaatg 

caggcggtgctcgcgcccaaggtggtggggctcgtcaacctggaccacgccacccgcgagctgccc 

ctggatttcttcgtcacgttctcgtccctggccgcgtttggaaacgccggtcagtcggactacgcgg 

cggccaatggcttcatggacggattcgcggagtcccgagcggcgctcgtgaacgccggacagcggc 

agggccggacggtgtccatccgttggccgctctgggagaacggcgggatgcagctcgactcacgga 

gccgtgaggtcttgatgcagcggaccgggatggccgcgctgggagacgaagcgggactgggggcgt 

tctaccgggcgctggaactgggctcccctggtgtcgcggtgtggacgggggaggcccagaggittc 

gtgaactctccgtgagtgtttcgcccgcaccgcctccgcatcaggtggcgttggacgccgtggtgtc 

catcaccgagaaggtcgagacgaagctgaaggcgctcttcagcgaggtcacgcgatacgaagagcg 

ccgcatcgatgcccgccagccgatggagcgctatggcatcgactccatcatcatcacgcagatga-^c 

caagccctcgaagggccgtacaacgccctctcgaagacgctgttcttcgaataccggacgctcgcgg 

aagtcagcgggtatctggccgagcaccgcgcggaagagagcgcgaagtgggtggcggcacctgga 

gagaattcgtcttccgtcatccaggaggccaggccgccacgtgcggatgcgacgcaccgggcgcct 

cgcgccgacgaocccatcgccgtcattggcatgagcggccgttatcccggggcggagaacctg^cg 

gagttctgggagcgcctgagccgcggtgacgactgcatcaccgagattccgccagagcgctggtcg 

ttggacgggttcttctacccggacaagaagcacgccgccgcgcgggggatgagctacagcaagtgg 

GGCGGCTTCCTCGGCGGCrrCGCTGACTTCGACCCGCTGTTClTCAACATCTCGCCGCGTGAGGCGA 
CGAGCATGGACCCGCAGGAGCGCTTGTTCCTGCAGAGCTGCTGGGAGGTCCTGGAGGACGCGGGG-r 

ACACCCGGGACAGCCTGGCCCAGCGCTTTGGCAGCGCGGTGGGCGTTTTCGCGGGAATCACGAAGA 
CGGGCTACGAACTCTACGGCGCGGAGCTGGAAGGACGAGATGCCTCGGTCCGGCCCTATACGTCGT 
TTGCGTCTGTTGCCAACCG CGTCTCGTATCTG CTCGA CCTGAAGGGGCCGAGC ATGCCCGTGGAC AC 

CATGTGCTCGGCCTCGCTGACAGCCGTCCACATGGCTTGCGAGGCGCTGCAACGAGGCGCCTGCGTC 
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ATGGCCATCGCGGGTGQAGTCSAArCTCTACC-TCCACCCG-TCGAGCTACGTCAGCCTGTCCGGGCAGC 
AGATGCTGTCGAC 



REGION 2 

TaSUL - Surface layer protein 

From nocteotido 2955 to 60U iwfo*) ! 7SS : 

vertltvxvadseeqlvfstarstgaa j 

I 

I 

TaE2 - twro cam^osaeait $vsts^^ ^™ nne regulator 
'Frnm nucleotide 3116 to 47fl2*ri gff ! f aa ) s 529 * 

Sgdgalygvi^ 

YR^^I^LENCffiSA^CEGElLEEaPLPDVDRAALPPPAAAQGVNAPTAPAPLDAGLLPLAEVER 
SHILRVLDAVKGmTAAARVLAlGBHrLAEKLKEYGLGDEP 

TaR3 >■ two component system, fchiase semsor. 

From aiacleotida 5595 to 4720. M aa): 292 

M^SOAEAPWST^TVffiVRFHGVR^ 

MD^ALHARFVTJVGPFRV^IDVPKPQ^ 
CLDAQYIPDEYEGRKGVAKKGWGH3T^^ 

LFPVCSPASEGQRLVLGRAA 

TaA - NUS-G liks trm «»»«"«*»" aatitermiaatior. 
From nucleotide 6290 *n OTre. gjg^aa^: 16§ 

QLaFIVSVGLLQRSVSVEVSAEQL^TTD 
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TaB - acvi earner protein (AC¥\ 

From xraeleotide 6§7fl to 7106, sise(aal: 79 

MDKIUIFDIYTSSVREVVPEI^ 
DVLHAKJL 



TaC - beta*-ketoacvi fACTI gyatliase HI {KAS ITT. FahH) 
From aiscleotide 7119 to g37§„ slaefaa^: 420 

mgfvgieam&aycgiarldvlqlath^ 
ekdsiellvactessfpfgkamsty^ 

vvasdlskfsiaeggdastedwsfaepssgagavamlvsdtpryfrvdvga 

sea gdadls1xs vld ccenafreytbs.vp aany aesfgvx,afhit[fggmvkg ahrtmlvockf sgkkr.gd 

ieadfqrrvapgltycqrvgnm^ 

RQKAXGLGEaLGR^QQLSMPDYDALLXGNG 
PYQWX3 



TaB - miQmhY%M® associated prateim. 
Fyom Bsacleotide 8404 to $m®(&a.lz 325 

MSSVATAVttTAWDSAVSM^ 

yyf*urgtpal;efgal^ 
yaetfnrwhsot^ 

PLRYAVDVIkDVNGVGLIYFAS^ 

d vqhwervgggeblfnvicmrega q GBX3IA v atvk; ^KFD AaSEG grrg 



TaE - ag^i carrier Brotcia (ACFL 
From nucleotide 93§6 to 9364. aizefaaft: 

MTD£QI&GYVHQ8WRVX.P&\^^ 
VAFLAGEVAGG 



T&F - tseta-ketoatcyi fACPl sysatfaagis IH (KAS IXL F&bSBQ 
From nucleotide 9757 to 10g?g, glaefaa): 374 

MMQERGVALITODPVTN^ 
EVKQACYAATGALQLAUFi^ 
MDLGAFGNYSYDVFDTAOTSPEIDI^ 
GLVXAGHRKM&d2^^ 

SYGSGCSSEFFSGVXGFESVSAI^GLDXGGHLRGR^^ 
VTKTASBPBMLALR&VVDYHRQYEWV 



TaG - signal BeBtidase H flLSP 

ffrora nucleotide 10909 to 11421. gjgsfaak 171 
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MNTPSL1NWFARLGYLLAW 
FGEEWSMFFPYVYGAIC^ 

D R5 YW SFNIADT A VVV G.4ALMIL^3 F3Ji?H.C CL5? G 
TaH - cytochrome P4S0 livciro^riage feP t 450') 
From nucleotide 11473 to 12897, sizefaa): 475 

MGTSEPYEPDHALSKPPPVaPVGAQaLPRGPaM^ 

HQTSDPALmVTFXGDPDLFLGGjg^^ 

IU3rWAALDRWPVGKPFAV^ 

AEGFARAVGKL^SLDWASL^ID^^ 

MTLLMAGHETSATIAAWCVYHLCROT 

VFSLVaRVLKEPQTIGGTTYPA^^^ 

CIGTSFAYYEMKIFVSETVRRMRFD 



Tal ■ maloiryl CaA FACFT tramacylase (MCT- FabB) 
From inigleotide 1293& to 13891 ? sigefsraV. 31S 

MVX3SVS&QAR&&VFLFSGQGTQ 

PRGGMLAVLIffiLEACRGFTSVASUDG 

SHLBPLKEEYIiSRVJL&^ 

DVGPAASFAOTKSOLARDST^^ 



TaJ - sm^IonTl €oA fACPl transaeylwe (MCX, FabBl 

Ftom nacaeotgde .35909 to 14S9& gjgefoate 330 

MTEAFAPIL4PAQVFPPPSSPWALHT^^ 

LGYSIKI^CLEDPGKELAQTQF^ 

TGIJ^VKMUj]^ 

SPVQWTaTV&YLLEQGVBDFB^ 



From nucleotide 14963 to 16213. gizefaate 417 
MSTSPVQELWSCTGVTSAI^ 

gytpqlw^atfsgqaalvtvheawnaa^ 
aaygstfmdtdlvglctqqfamgmsftv^ 

CQGLBAMGaMGTD&FaB^ERACR^^ 
DASRG^LSSIEKESQVIGAALRB^ 
HGL3SAGAVGLlAlXVQIJ?QG3U^K?SL^ 
MATES 
TaL - esoyl CoA favdratase. 

From imgieatidfe 16224 to 17009* sizefaafts 262 

MQA^SPPHE^YQTLRVBFE^ 
G AJDEXAIHDRVDDGRSE^ 
FSLSELLFGI^ACVMFFL^^ 
KPaVTQYK&YaSELGGQLLAa&^ 
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TaM - snovi CoA kvdratase. 



From imcgenfcide 17Q0O to 37767> sizefaa): 256 

MG1>ITBGTPMA?VVTLHEV^ 

Vf4I,GGT&AGLLSICDGIG3Kn^T^^ 

FMRYGFTPGMGATYIYPKR^ 



TaN - Q-met&vltransferage ffragmentX 



From miacleotide 17782 to 19G53. sisgefaa^: 423 

^DVYAVTAAAAAHRSFPR^ 

pijk4IALEQRGGLKBARM-S^^ 

^ALLFAIVGSYRPMLASMPQLLFGD^ 

AENLSAQP&YIADMGCGDGTIXJ^ 

ALBAJDVARPDBXXED^ 

EVFHSLVEHLB 



DNA sequence 1-19053 

GTCGACGTTGACGTCGCCCGGTGGCGTGCCGTGTGTCTTCTTCGACGCGGAGGTGCGCGAGGTGGCG 

GCGGACGGCCGGCGCGGGCCGCTGTTGTCGCGTGAGCGCGCGTATGCGCCGGTACTGGCGCTGCGT 

GGGCAGCGCCTCCATGCTTCGGTGTCCTTTTCGCCCGCGTCGCTGATGGGTCCGGTGGAGGTGGGCC 

GGTGCAAGGCCCTGCCAGGCACGGTGCCCGCGTCCTGGTATCAGACGGCGCACCCGGAGGCCCTGT 

CCTGGGAGCGCGTGGGCGCGGTGGGGGAATCCTGCCTCGTGGTGGGTGAACTCCGGAGGGGCCCTG 

TCGAGGGCAGCTACGCCCTGGTCGGTCGGGAGGGCGGCCCCGCGATGTTGGTGCTGGGACCCCAGG 

CTCCGGCCACCTGTGGGACGCTGGCGCGCCGGGCCTGGCGGCACTTCGGGGCGGCCGGGGTGCTGT 

CCATGGCCGCGGCCGTCGTCCTGTCAGGGGCGCTGTGAGACGCGCGGCGGGGGCCGTACCGCCGCG 

CCAGAAACGTGATGCGCCGCCAGGCCTCGCGGTCCGGGCACTGACGCCCGGGCCGCTCGGGACTCG 

CTCAGGCGGCTCCGGTGCTTCGCGCGGTGGAGAACACGAGCTGTTCCTCGCTGTCCGCCACCCGCAC 

GGTGAGGGTCCGCTCCACGCCGGCGAGGCCCAGCGGCGTGGACTGCGCCAGGTCCGAGAGCAGGGA 

GCGCGCAGCGCGCAGGTGGAAGCCGGTGGTCCACATGCCCTCCAGCTCGCCGAACACGGTGCGCAG 

CTCGTCCGGGGCCTTGCGTTCGTCGATGGCGCGGCGCAGGCGCACCATGCCGCTCACGAAGTTCCTG 

CACCGCATGAGCGTGTTGGGGTTCTTGGAGATGACCTCCGCGAAGCTGAAGTTGCCGCCACCCGGGC 

gccactcgctttcgatgagctgcaggtccttgccaaggtcgcgcaccgtggggtccttctggaagta 

CCACTTGGCGATCTGCGCGGCGCGGCTGGGTGACAAGTCATTCAGCATGAGGCTGCCTTGCTCCTGC 

ACCTCGCGGAGGTAGGCCTCCCAGATGGGGGeGTAGACCGCGCGCCGGAACTCGGCGGAGGCGCGG 

GGAAGCTGCTCGCTCCAGGGCAGCGCGCGGGCCAGGGCGCGTGAGAAGCGGGACAGCTCGAGCGTC 

TGGATGCGGGGCACCAGGGCGCGGAACGAGTCATCCGCCATCTTCAGCTCGAGCCGCGTTTCGATG 

GGGTGCTTGCCCATGTCCTCCTGCATGGTGCTGATGACGGTGGCCGCGTCCTTCGCGTCCAGCACGC 

CCCAGACGACGGCGTCATCCACCGCCTGCiTGCAGGTCCTTGCGCGACAGCTTCTTCTGCCGTCCCCA 

CAGCATCAGGTGCAGGCCGTAGCCGAAGTCGGAGGCCACGGGGGTGGGAGAGAAGCGCGTCATGTC 

CGCCTTCAGGTCCACCACCCACTGGCCGCGGGTGCCCAGCAGGTTGTCCTCGTAGCCCCGGCGTCCG 

AGGAACGCCATGCGCCGGGCGCCCTGGAAGTTCTCCTGCGTCACCCAGGACTGCTTGCTGACGTTCT 

TCGCCTTGAGCAGCTCGAACGAGCCCAGCCCCAGTGAGAAGCCGATGGCCTGCTGGCGCGTGAGCG 

TGGTGGCGTGCAGGTAGTTGCGCAGCTCGAACTCGCTCTGCTGGGCGGGGAGGCTCTTCATCCACTT 

CAGCAGCTCCACCAGGTTGCCCTTGAGCAGGGACTCGTGGAAGCGCATCGCGGTGACGTCCTCCACG 

A,CGACCTCCAGCAGCGTGGAGGTCTCCGACAGGCGCAGGTATTCGTACTCGAAGCCCTCGGCGACCT 

GCGTGCGGACGGCGTTCTCCAGCGACTCTGCGACGCGGGCCTTGAAGTCGGCCCAGGCCTGCGGAA 

GGTTGGCCGGGTCCGCAAGCTGCGGGTCGATGCCAAGGCGCTCCAGCACCAGGCCCAGCACCTTCTT 

CTGATTGTCGTCC^GCTrCGCCTTGCTGGCCTTGTCCACCAGGCCGTCCATGATTTCCACCGCGGTGG 

TGCCCTTCTTGACGAGGTCGCGAAGGACGGGCCCCAGCAGCGCTTCCAGCAACTGGCCCAGTTGGG- 

CCTTCACCGTCGCCGGGTCCAGCAGCTCCACGGTGATGCCCAGGCCGGCGGAGAGCGCCTGCTCATG 

GGAC^TCACCTTGCGCACGGCGACCTGGATGCGGCCGGCACGGGGACGGGAGAAGCTGAGCTGGTA 

GTCGTCGGTGAGGGACACCCGGGCGGACAGGCCCGCC1TGGCGCTGGTCTTCAACGCGAGCAGCTC 
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gttgccgcgcac-gaagcccaegcggttgaggttggtggggaagatgtccccccagttgagctccag 

ccgtgtgtggagcgcgccgcqgacateccacaccaccgcgtcccccgtggtcagcttggccaggtcg 

gtggcctgcatcagccgcagctcggacaggtcggagcgcacggcctcacgcatgttctggcccagc 

ggatgcgcgcggtagtcgctgagcgtgacggacagctccgtctggttctcggaggcgaggaaggac 

aggctggcgctcacggcggccttcacgcgcgcggacacctggtagcgcatccacccgatgtcactgc 

ccagcagcagttggggccggggccctggctcggcgctctccgggctcggctcgccgatgatgccgtt 

ttcgtccacgtcgcccagcgagttgaagttccggatggacgcttcgccggtggcttcgaaggagag 

ggtgaagccgtgcatggcgagcttgggcggaaggattttctcttccgtgaaggtcccctgggccagc 

ggcaccttgatgtcctggtcctgo^gcttcacgtcgggcaccttgcccgccacgacgtcgggaagct 

tctccagcagcttgttgaccactttcatgcgcgtccccctgggctgaagcctcctgcacgtgggccg 

gaggtctcttcgtcgtacgccgttgcccagctcggaacaaggcggataccagaaaagaccggtggt 

cagcggacagatgccctggagggtggggtgggagccgcccccgcgcggtgcgtcagggctcgtcgc 

ccaatccgtactccttgagtttccgcgcgagcgtgttgcggccaatcgccagcacgcgggccgcggc 

ggtgcggttgcccttcacggcgtccagcacgcgcaggatgtggcggcgttcgacctccgccagtggc 

agcaggcccgcatccaggggcgcaggcgcagtcggcgcgttgacaccctgagcggctgcgggaggc 

ggcagggcggcccggtccacatcgggcaggggcaggtgctcttcgagaatctccccttcacagagc 

accacggcgctctcgatacagttctccagctcccgcacgtttccgggccagcggtagcgcttgaggc 

gctccaccgcggcggcgctgaggcggggcggcgtcagccggtgcctccgggcgacggcggcgacga 

agtggcgggcgagccgctcgatgtcctccgcgccgcgctcccgcagcggcggcagcaccacctcga 

ccaccttgatgcggtagtagaggtcctcgcggaagcggccctcggccaccatgcgggccaggtccc 

gatgggtggccgcgacgatgcgcacgtccaccttcacggcctgggtgcctcccacgcgctcgaactc 

gcgatcctggatgacccgcagcaacttgccctgcaccggcaggggcagctcgccaatctcgtcgatg 

aacacggtgccgccgctggccgcttcgaacttgccgggcacgcggtggtccgcgccggtgaaggcg 

ccgcgttcgtggccgaagagctcgttctcgatgagcgtggcgggcagcgccgcgcagtccaccttga 

tgaagggctggtccctgcggggaccattcacgtggacggcacgggcgaacagctccttgccgctgc 

cactctcgccgcgcagcagcaccgtcgcatcggtgggcgcggccttgcgcaccagtcggtagatgg 

cctggagctgcggggactcgccgatgatgcggttgaagaagtagcccaccggtacctggggctgct 

ccttcgcgcgctggagctcttgatagaggctggtgctctggagggcggtgctcacctgcgaggcgat 

ggcggtgagccgctgcgtgtcctcgtcggtgaagcggtcctcgccgcggcggttgaggagctggao, 

cacgccgtagagggcgccgtccccgtcgcgcagtggcacggcgagcaggctggtggtgcggtagcc 

cgtcatccggtcgatgtccgcgaagaagcgctgctcgccgcgcgggtgcggcacgttgatggcgtg 

ccccgccttggcgacggtgccggcgacgccctggcccagcttgacgcgaatctgggacacctcgggc 

aggtgcgcggcgcggctgaacagctcgcggcgggccgggtccagcagccagagcgtgccgcggtcc 

gcttggagggtgatggcgaxgcggtccatcagcgtctggaggaacgcgtcgaggtccacctccctg 

ccgacgagtcctccgaaggggaggaggagctgggagacgtccgagggggcttggggcatggcggg 

caacggcggcaggacgaaggcggaggccgcaccataacatccagagggcatgggactgccccctct 

caggccgcgcggcccagcaccagccgctggccttcgcgtgcgggctcgcacacggggaagagggcg 

cgggcctgctccgccatgtcctcgagcatgtcgtcgccgtgcgccgggtcatggtggaacaggcaca 

gccggcgcgcccccaccagcccggccacgcccgcggcatccaxcatggtggagtggccccagccctt 

cttcgccacgcccttgcggccctcgtattcgtccggcgtgtactgcgcatccaggcacaggacgtcc 

gcgccctcgaagaggcggcccacctccggcgcgagctcctgcacccgcacctccacgtccgtggcgt 

agacgaacgaatggccatccgcctccaggciggtaggccaggcacccctgcgggtgcggcacgtcga 

tgggcgtgacgcggaaggggcccacctccacgggtcgggcatgcaacgccgagcggaagtccatcc 

gcgagcgcatggtgctcagcggcaccggaaaatgaagcggctgcatctgcgcggccaactcggact 

ggagcgcctgggccccattcgcgcccggaccgtagagcgtcagctcggacgtgggcagccaggccg 

gcgtgaagaaggggaagccctgcacgtggtcccaatgcagatgcgagaagaagagcgtggcctcct 

ggggcgcgccctcgcgcatcatgatttcgcccagtgcgcggatgcccgtccccgcatccaggatgag 

gcggtggccctggctggtcacctccacgcaggccgtgttgccaccaatgcgcgagcccgagaccgcg 

atgctcccccgaacgccatgaaaccggacttccatcgtaagtctccttgaatggggggcctccgcct 

gggacgccctcatggccggagcctc^gagcacggggtgtgcc^ttcccaaatgcccggaatcagga 

gcgcgggcctcgggctcgtccaccggtgctccagaatggatcgcgctcgcctggtgcgggcgatcca 

aagcggtgcaggtcgcccgcaggacggggcggcgggcacgtcttccaacgtcccacggcagtcctg 

tcttcagatctctcccgatgcgggaaggcgtccaggaggttgcacccggcatcgagcggggctgtgt 

gtttcaagtcttgtcggagcctcggacacaaccgtctgggttctgggaatgcgccggcttccgttca 

CTCCAGAGTGATTCAATGGCTCTCGAGTGCAGGTTTAGCAATCCTCGGGCCGTAACCACGCCGTTGA 
AGGCAGTCACGCTCTCGTCACGCTTGGGGTGTTTCCAGCTTCAACGGTGTTTATCCTTCAGGGCGGT 
TTGCTTGACACGCTGCCTCATGGAAGCGTATGCAAAACAATGAAAACGGTGTCGTTGCCGAGCCTTA 
GGGCCTCCAGAACGCCATCCTCGCGGACCCAGGCAGCCGGAATTTGAGACGGGGCTGTCAGCGGTT 
TGAACGCAAGGATGCGGCGGG ggttgtgg CGGCAGCCCGACCAGAATTCGGTTGGTGTGCCAGTTA 

ttgta^gattctgagaaatagcaggctggggggaagttgcaatgcctgggccgcggtgtgctgaga 
acgattgggttgcattgctcgtccgcgtcaatcacgagaaagtggctgccgctcagttggggaaaca 

CGGCTACGAGTTCTTCCTGCCGACGTACACGCCTCCCAAGTCCTCGGGTGTGAAGGCGAAGCTTCCG 
CTCTTCCCCGGGTACCTTTTCTGTCGTIACCAGCCGCTCAATCCGTACCGCATCGTCCGGGCGCCCGG 
GGTCATCCGGCTGCTCGGAGGTGACGCGGGGCCGGAAGCCGTGCCCGCACAGGAATTGGAGGCCAT 
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ccgcccggtcgcggattccggtgtctcttccaatccctgtgactatctgcgggtggggcagcgcgtg 

cgcatcatcgaagggcccctgacaggtctggaaggaagtctggtgacgagcaagagccaactccgg 

ttcattgtctccgtggggctgctacagcgctcc-gtgtccgtggaggtgagcgccgagcaactggaac 

cgatcaccgactgattccgcggacatccccttccattccttcatcaccccgacccgcagcaaggcttc 

A GGGACC gtgagtcgttc catggacaagag a attattttcgacatcgtc acc agc A.GTGTTC GGG AG 

gtggtacccgaactcgaatcacatccgttcgagccggaggatgacctggtcggactgggcgcgaac 

tcgctcgaccgcgccgaaatcgtcaacctcacgctggagaagctggcgctcaacatcccccgggtcg 

agctga'ttgacgcgaagaccattggcgggctggtggacgtccttcacgcgaggctgtgaggcgaag 

ccatggggccggtcgggattgaagccatgaatgcctactgtggcatcgccaggttggatgtgttgca 

gctggcgacccaccgtggcctggacacctcccgcttcgcgaacctgctcatggaggagaagaccgtc 

ccgctcccctatgaggaccctgtcacctacggcgtgaatgccgcccggcccatcctggaccagttga 

ccgcggcggaacgggacagcatcgagctgctggtggcttgc^cggagtcctcgttcgacttcggca 

aggccatgagcacctacctgcaccagcacctggggctgagccgcaactgccggctcatcgagctca 

agagcgcctgctactccggggtcgccgggctgcagatggccgtcaacttcatcctgtccggcgtgtc 

gccgggggccaaggccctggtggtggcctccgacctgtcgcgcttctccatcgccgaagggggaga 

tgcctccacggaggactggtccttcgcggagccgagctcgggtgcgggcgcggtggccatgctggt 

gagcgacacgccccgggtgttccgcgtcgacgtgggggcgaacggctactacggctacgaggtgat 

ggatacctgccgcccggtggcggacagcgaagcgggagacgcggacctgtcgctcctctcgtacct 

GGACTGCTGTGAGAACGCCTTCCGGGAGTACaCCCGCCGCGTCCCCGCGGCGAaCTACGCGGAGAG 

cttcggctacctcgccttccacacgccgtttggcggcatggtgaagggcgcccaccgcacgatgatg 

cgcaagttctccggcaagaaccgcggggacatcgaagcggacttccagcggcgagtggcccccggg 

ctgacctactgccagcgcgtggggaacatcatgggcgcgacgatggcgctctcgctcctcgggacc 

atcgaccacggcgacttcgccaccgcgaagcggattggctgcttctcgtatggctcggggtgcagct 

cggagttcttcagcggcgtggtgacggaggaagggcagcagcggcagcgcgccctggggctggga 

gaagcgctggggcgccggcagcagctctccatgccggattacgacgcgctgctgaaggggaacggc 

ctggtgcgcttcgggacccgoaacgccgagctggatttcggtgtcgtcggcagcatccggccgggc 

gggtggggcaggcccttgctcttcttgtcggcgaitcgtgacttccatcgcgactaccaatggattt 

cctagcctcggggcttcgagcaaagccatgtccagcgtagcgacggccgtccccctgacggcccgtg 

acagcgcggtgagccgccggctgcgaatcacccccagcatgtgcggccagacgtccttgttcgccgg 

gcagattggcgactgggcatgggacaccgtcagccgcctgtgtggcacggacgtgctgaccgcgac 

caacgcctcaggcgcgcccacctacctggccttctattacttccgcatccggggcacgcccgcgctg 

catcccggcgcgctgcgcttcggcgacacgctggaggtcacgtcgaaggcgtacaacttcggcagc 

gaatccgtcctgacggtgcaccgcatctgcaagacggcggagggcggcgctccggaggcggatgcc 

ttcggccatgaagagctgtacgagcagccccagccaggccgcatctacgcggagaccttcaaccgg 

tggatcacgcgctcggacggcaagtcgaacgagagcctgatcaagtcctcgcccgtggggttccag 

tacgcacacgtgccgctcttgccggacgaatactcgccgcggcgggcctatggggacgcgcgggcg 

cggggcacctttcacgatgtggactccgcggagtaccggctgaccgtggaccgcttcccgctgcgct 

acgcggtggacgtcatccgggacgtcaatggggtggggctcatctacttcgcgtcgtatttctcgat 

ggtggactgggccatctggcagctggcgaggcaccagggacgcagcgagcaggccttcctgtcgcg 

cgtggtgctggaccagcaactgtgcttcctcggcaacgcggcgctggacaccaccttcgacatcgac 

gtgcagcactgggagcgggtgggcggcggggaagagctgttcaacgtgaagatgggcgagggcgc 

gcagggccgggacatcgccgtggcgacggtcaaggtgcgcttcgacgccgcttcggaaggaggccg 

ccgtgggtgagccgatgacagacgaacaaatccgcggagtcgtgcaccagtccatcgtgcgcgtcc 

tgccccgcgtgcgctccaacgagattgcgggccacttgaacctccgcgagctgggcgcggactccgt 

ggaccgggtcgagattctcacgtccatcctggacagcctgcggctgcagaagacgccactggcgaa 

gttcgccgacatccgcaacatcgacgcgctggtggcgttcctggccggtgaggtcgcgggtggctg 

agcgggttcccggcggagtcggcatcgaggccatcaacgcctacggcggcgccgcctccattccgg 

tgttggacttgttccggggccggcggctggaccccgaag-cgattctccaacctgatgatgcaggagc 

gcggcgtcgcgctgccgttcgaggaccccgtcaccaacgcggtcaatgcggcgcggcccatcctgg 

acgcgatgtcgcccgaggcccgggagcgcatcgagctcctggtcacctcgagcgagtccggcgtgg 

acttcagcaagtccatctcctcgtatgcgcacgagcacctggggctgagccgccactgccggttcct 

ggaggtgaagcaggcgtgttacgccgccaccggagcgctccagctagcgctgggctacatcgcgtc 

gggcgtgtcaccgggggccaaggccctggtgaitgccacggacgtgacgctggtggacgagagcgg 

tctgtactccgagccggcgatgggcaccggcggcgtcgccgtgctgctgggcgacgagccgcgcgt 

gatgaagatggacctgggagcgttcggcaactacagctacgacgtcttcgacaccgcgcggccctc 

gccggagattgatatcggcgacgtggaccggtcgctcttcacgtacctcgactgcctcaagcacagc 

ttcgccgcgtatggccgccgggtggacggtgtcgacttcgtgtcgacgttcgactacctggcgatgc 

AC ACGCC GTTCGCCGGA CTGGTGAAGGCCGGGCA CC(3CAAGATGaTGCGCGAGCTC ACCCCGTGCG 

ACGTGGACGAAATCGAAGCGGACTTCGGCCGGCGCqTGAAGCCGTCACTGCAGTACCCGAGTCTGG 

TCGGGAACCTGTGCTCCGGCTCCGTGTACCTGAGCCTGTGCAGCATCATCGACACCATCAAGCCCGA 

GCGGTCCGCTCGGGTGGGAATGTTCTCCTATGGGTCGGGTTGCTCGTCGGAGTTCTTCAGCGGCGTe 

ATCGGCCCGGAGTCCGTGTCCGCGCTAGGTGGGTTGGACATCGGTGGCCACCTCCGGGGGCGCCGC 

CAGCTCACGTTCGACCAATA.TGTCGAATTGCTGAAAGAGAACCTTCGCTGTCTGGTTCCAACGAAGA 

ACCGGGACGTGGACGTGGAGCGCTACCTCCCGCTGGTGAGCCGGACGGCGAGCCGCCCGCGCATGC 
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agcgoccgggccgtcagcatcacgccaacgggcagacgacggcccacctgggggcggaaatcgcct 
cgctggccgtgcccgaaggcgtcaccccacaactgtggcgctcggccacgttttcggggcaggccgc 
actggtgaccgtccacgaggcctggaacgcggcgcgcctccaggccgtccccggacaccggaitgg 
attggtggtggggggcaccaacgtggagcagcgcgacctggtgctgatgcaagacgcctatcgcga 

GCGGGTGCCCTITCTGCGGGCGGCCTACGGGTCGACCTTCATGGACACCGaCCTCGTGGGCCTCTGC 

ACGCAGCAGTTCGCCATCCACGGGATGTCCTTCACGGTGGGAGGCGCATCGGCCAGTGGCCTGCTG 

GCGGTCATCCAGGCCGCGGAGGCGGTGCTCTCAAGAAGGGTGGACGTTTGCATCGCCGTGGGGGCG 

ctgatggacgtctcctactgggaatgccagggcctgcgggccatgggcgcgatgggcaccgaccgg 

ttcgcgcgggagccggagcgtgcctgccggcccttcgaccgggagagtgatggcttcatctttggag 

aggcgtgcggcgccgtggtggttgagtctgcggagcacgctcggcgacgcggggtgactcctcgcg 

gcatcctgtcgggctgggccatgcagttggacgcgagccgcggcccgttgtcgtccatcgaagggg 

agtcgcaggtgattggggctgcgctgcggcacgcggacctcgcgccggagcgggtggactacgtga 

atcctcacggcagcggttcgcgtcagggggatgccatcgagctgggggccttgaaggcgtgcggcc 

tgacgcacgcccgggtcaacaccacgaagtccatcaccgggcatggcctgtcctcggcgggtgccgt 

ggggctcatcgccacgctggtccagttggagcagggccggctgcacccgtccttgaacctggtggac 

ccgattgattcatcgttccgctgggtgggggccaccgcggaggcccagtccctccagaacgcgctgg 

tgctcgcctacggcttcggcggcatcaacaccgctgtcgccgtgcgccggagcgccacggagagct 

gacacgcccatgcaagccgcttcccctccgcaccgcgactaccagacgctccgggtccgcttcgagg 

cgcagacctgttttctccagctccaccggccggatgcggacaacaccatcagccgcacgctgattga 

cgagtgccagcaggtgctcacgttatgtgaggagcacgccaccacggtggtgctcgaaggcctgcc 

acacgtgttctgcatgggcgcggattttcgagccatccacgaccgggtcgacgacggccgccggga 

gcaaggcaacgcggagcagctgtaccggctgtggctgcaactggcgacaggcccctacgtgacggt 

cgcccatgtgcagggcaaggccaacgcgggcggcctgggcttcgtcgccgcgtgcgacatcgtgct 

ggcaaaggcggaggtccagttcagtctctccgagctgctgttcgggctgttccccgcctgcgtgatg 

ccgttcctcgcccggcgaatcggcatccagcgggcgcactacctgacgctgatgacgcggcccatcg 

acgcggcccaggcgctgagctgggggtrggcggacgcggtggacgccgatagcgagaagctgttgc 

ggctccacttgcgcaggctgcggtgcctgtcgaagccagcggtgacccagtacaagaagtacgcct 

ccgagctgggcggccagctgctcgcggccatgccccgggccatctccgccaatgaggcgatgttctc 

cgaccgcgccacgctggaaggcatccatcgctacgtggagacaggccgactcccatgggaatcatg 

acggaaggaacgccaatggcgccggtggtcacgctccatgaggtggaggagggggtggcgcagat 

caccctggtggatcgcgagaacaagaacatgttcagcgagcagctcgtgcgcgagctcatcaccgt 

gttcggcaaggtgaatggaaacgagcgctaccgcgcggtggtgctcaccggctacgacacctactt 

cgcgctcggcgggaccaaggccggcctgctgtccatctgcgacggcatcggctccttcaacgtcacc 

aacttctacagcctcgggctggagtgcgacatcccggtgatttccgccatgcagggacatggcgtag 

gcggcgggttcgcgatggggctgttcgcggacttcgtggtcctgagccgggagagcgtctacacga 

cgaacttcatgcgctacggcttcacgccggggatgggcgccacgtacatcgtgccgaagcggctgg 

ggtactcgctcgggcatgagctcctgctcaacgccaggaactaccgcggcgccgacctggagaagc 

ggggcgtgccttttccggtgttgccgcgcaaggaagtcttgccccacgcctacgagattgcgaggga 

cctggccgcgaaacctcggctgtcgctcgtgacgctcaagcggcacctggttcgcgacatccgccga 

gagcttccggacgtcatcgagcgtgagctggagatgcacggcatcaccttccatcacgacgacgtg 

aggaggcgcatcgagcagctgttcctctgaggcgcgcccctatgttgaaccxgatcaacaaccacgc 

acacggttatgtggtcacgcccgtggtcctgggctgcaacgacgctggcctgttcgaactcctgcgg 

cagggaccgaaggacttcgaccggtrggcggaggcattgcgtgccaaccggggacatctgcgcgtc 

gcgatgaggatgttcgaatcgctcggctgggttcgccgcgacgcggatgacgtgtacgcggtgacg 

gcggcggcggccgcgcatcggtccttcccccgcgaggcgcagtcgctcttcgcgctgcccatggacc 

GGTACCTGCGCGGGGAGGACGGCCTGTCCCTGGCGCCGTGGTTCGAGCGCTCTCGGGCGTCGTGGG 

aTaccgatgacacgctggtgcgcgagctgctcgacggcgccatcatcacgccgctgatgctcgcgct 

GGAGCAGCGTGGGGGCCTCAAGGAGGCGAGGCGTCTGTCCGACCTGTGGTCCGGGGGGGaTGGAA 

GGGACACGTGCGTCCCCGAGGCCGTCCAACACGAGCTGGCCGGGTTCTTCTCCGCGCAGAAGTGGA 

CGCGTGAGGACGCCGTCGACGCGGAGCTCACGCCCAAGGGCGCCTTCATCTTCGAGCGGGCATTGC 

TCTTCGCCATCGTCGGCTCGTACCGGCCGATGCTGGCCAGCATGCCGCAGCTGCTCITCGGTGACTG 

CGACCAGGTCTTCGGGCGGGACGAAGCGGGCCACGAACTGCACCTGGACCGAACCCTCAACGTGAT 

TGGGAGCGGCCACCAGCACCGGAAGTACTTCGCGGAGCTGGAGAAGCTCATCATCACCGTCTTCGA 

TGCCGAGAACCTGTCGGCACAGCCGCGCTACATCGCGGACATGGGGTGCGGTGACGGCACGCTCCT 

GAAGCGGGTGTATGA,^CGGTGCTTCGGCACACGCGGC^GGGAAGGeCGCTCGACCGGTTTCCGCT 

CACGCTCAXCGCCGCGGACTTCAACGAGAAGGCGCTCGAAGCCGCTGGGCGGACGCTGGCCGGGTr 

GGAGCACGTTGCCTTGCGCGCGGACGTGGCGCGGCCGGACCGTCTCATCGAGGACCTGCGGGCGCG 

GGGGCTAGCCGAGCCTGAGAATACGCTGCACATCCGCTCjGTTTCTCGACCACGACCGTCCCTACCAG 

CCTCCCGCGGACAGGGCGGGGCTCCACGCCCGGATTCCGTTCGATTCGGTGTTCGTGGGCAAGGCG 

GGCCAGGAGGTGGTTCCGGCGGAGGTGTTCCACAGCCTGGTGGAGCACCTCGAG : 
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CLAIMS 

What is claimed is: 

L A purified, isolated and cloned DNA sequence partially encoding a 
functional portion of a polypeptide component required for the synthesis of antibiotic 

TA 0 

2. The DNA sequence according to claim 1, wherein said sequence is isolated 
from Myxococcus xanthus. 

3. A purified, isolated and cloned DNA sequence consisting of a DNA 
sequence encoding a polypeptide component required for postmodification of 
antibiotic TA. 

4. The DNA sequence according to claim 3, wherein said sequence is isolated 
from Myxococcus xanthus. 

5. A purified, isolated and cloned DNA sequence consisting of a DNA 
sequence encoding a gene product involved in the regulation of the biosynthesis of 
antibiotic TA. 

6. The DNA sequence according to claim 5, wherein said sequence is isolated 
from Myxococcus xantkus. 

7. A purified, isolated and cloned DNA sequence consisting of a DNA 
sequence (Seq. ID No:l and 2) encoding a polypeptide component required for 
encoding the TA gene cluster. 
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8. The BNA sequence of 3eq. ID No:l and 2 altered by point mutations, 
deletions or insertions such as the resulting amino acid sequence is truncated. 

9. A transformed E coli carrying Seq. ID No:l and 2. 

10. A vector which comprises the DNA according to claim 7. 

11. A host cell, wherein the host cell is selected from the group of suitable 
eucaryotic and procaryotic cells, which is transformed with the vector according to 
claim 10. 

12. The host cell according to claim 1 1 which is E. coli. 

13. A recombinant expression vector comprising a DNA sequence according 
to claim 7. 

14. A cosmid containing the DNA sequence according to claim 7. 

15. A method of using the TA genes for combinatorial genetics. 

16. A-method of-usiag the TA genes encoding for the synthesis, modification 
or regulation of antibiotic TA. 
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TITLE 



ABSTRACT OF THE DISCLOSURE 



There is provided a purified, isolated and cloned DNA sequence partially 
encoding a functional portion of a polypeptide component required for the synthesis 
of antibiotic TA. Also provided are purified, isolated and cloned DNA sequences 
encoding a polypeptide component required for postmodification of antibiotic TA and 
encoding a gene product involved in the regulation of the biosynthesis of antibiotic 
TA„ A purified, isolated and cloned DNA sequence having a DNA sequence (Seq. ID 
No:l and 2) encoding a polypeptide component required for encoding the TA gene 
cluster and any mutations thereof is provided. Also provided are methods of using the 
TA genes for combinatorial genetics and of using the TA genes encoding for synthesis 
and modification or regulation of antibiotic TA. 
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Docket No. 
j 2290,00076 

Declaration and Power of Attorney For Patent Application 

English Language Declaration 

As a below named inventor, i hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name, 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, 
first and joint inventor (if plural names are listed below) of the subject matter which is claimed and for 
which a patent is sought on the invention entitled 

GENE CLUSTER 

the specification of which 

(Iheck one) 

£3 is attached hereto. 

H was filed on January 29, 1999 as United States Application No. or PCT International 

fy Application Number 09/240,537 

!f : and was amended on 

(if applicable) 

I Pgreby state that I have reviewed and understand the contents of the above identified specification, 
including the claims, as amended by any amendment referred to above. 

1 ^knowledge the duty to disclose to the United States Patent and Trademark Office all information 
kngwn to me to be material to patentability as defined in Title 37, Code of Federal Regulations, 
Section 1.56. 

I hereby claim foreign priority benefits under Title 35, United States Code, Section 119(a)-(d) or 
Section 365(b) of any foreign application(s) for patent or inventor's certificate, or Section 365(a) of 
any PCT International application which designated at least one country other than the United 
States, listed below and have also identified below, by checking the box, any foreign application for 
patent or inventor's certificate or PCT International application having a filing date before that of the 
application on which priority is claimed. 

Prior Foreign Application(s) Priority Not Claimed 



□ 

(Number) (Country) (Day/Month/Year Filed) 

□ 

(Number) (Country) (Day/Month/Year Filed) 

□ 

Number) (Country) (Day/Month/Year Filed) 
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I hereby claim the benefit under 35 U.S.C. Section 119(e) of any United States provisional 
application(s) listed below: 



(Application Serial No.) (Filing Date) 



(Application Serial No.) (Filing Date) 



(Application Serial No.) (Filing Date) 

I hereby claim the benefit under 35 U. S. C. Section 120 of any United States applications), or 
Section 365(c) of any PCT International application designating the United States, listed below and, 
insofar as the subject matter of each of the claims of this application is not disclosed in the prior 
United States or PCT International application in the manner provided by the first paragraph of 35 
U.S.C. Section 112, I acknowledge the duty to disclose to the United States Patent and Trademark 
Office all information known to me to be material to patentability as defined in Title 37, C. F. R., 
Section 1 .56 which became available between the filing date of the prior application and the national 
a? PCT International filing date of this application: 



(Application Serial No.) 



(Filing Date) 



(Status) 

(patented, pending, abandoned) 



(Application Serial No.) 



(Filing Date) 



(Status) 

(patented, pending, abandoned) 



(Application Serial No.) 



(Filing Date) 



(Status) 

(patented, pending, abandoned) 



I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that 
such willful false statements may jeopardize the validity of the application or any patent issued 
thereon. 
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POWER OF ATTORNEY; As a named inventor, I hereby appoint the following attorney(s) and/or 
agent(s) to prosecute this application and transact all business in the Patent and Trademark Office 
connected therewith, (list name and registration number) 
Kenneth L Kohn (30,955) 



Send Correspondence to: Kenneth L Koto 

Kohn & Associates 



30500 Northwestern Highway, Ste. 410 
Farmington Hills, MI 48334 



Qirect Telephone Calls to: (name and telephone number) 
llenneth L Kohn (248) 539-5050 



s Full name of sole or first inventor 
I Eugene Rosenberg 



i Sole or first inventor's signature ( 



-J* Residence 

& Givat Shmuel, Israel 



i Citizenship 
Isreali 



Post Office Address 
18 Rahavat Han 



Givat Shmuel 



Full name of second inventor, if any 

ElioraRon /"'/ 

— — l — "Date" 

Second inventor's signature / . y 



lafcre / '.J 

<X / * 9^ Lon Feb. 1999 



Residence 

Tel Aviv, Israel 



Citizenship 
Israeli 



Post Office Address 

36 Yehuda Hanasi Street 



Tel Aviv, Israel 
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Ci ii! noma *rf fHirrf inviantnr if any 
rUIi name OT uiliu inveniui, a any 




Elisha Orr 




1 niiu invciiUJi o otyi laiutc y r . f ^ ? 
__A — ~ 


Date 

Feb, 1999 


Residence 




United Kingdom 




Citizenship 




Israeli 




Post Office Address 




23 Greenhill Road 




Leicester, LE2 3DN, United Kingdom 



Full name of fourth inventor, if any 




Yossi Paitan 




Fourth inventor's si 9™^^^ 


Date 


Feb- 1999 


Residence /^z^ 




Risfaon Le^Zion, Israel 




;\j Citizenship 




.'S Israeli 




y Post Office Address 




41 Hertzel st 




O Rishon Le-Zion, 75296, Israel 



Full name of fifth inventor, if any 

— — — Date 



Fifth inventor's signature 



Residence 



^ Citizenship 



Post Office Address 



Full name of sixth inventor, if any 



Sixth inventor's signature 



Residence 



Citizenship 



Post Office Address 



J- 
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